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ABSTRACT 


In  this  repart  we  try  ta  establish  an  upper  bound  an  the  information  capacity  af  an 
electramognetic  wave  propagation  circuit  between  two  apertures  when  quantum 
effects  came  into  play.  We  first  discuss  the  specifications  of  a  signal,  and  find 
that  we  con  associate  approximately  BT  quontities  with  a  signal  of  bandwidth  B 
and  duration  T,and  thot  the  energy  assacioted  with  these  quontities  must  differ  by 
at  least  Titj.  We  then  consider  radiotive  attenuotian  lass  within  the  framework  af 
quantum  theory  and  find  that  the  radiative  attenuation  introduces  a  random  naise 
in  the  circuit  which  is  analogous  to  partition  tube  noise. 

With  the  channel  matrix  determined,  we  discuss  the  problem  of  evaluating  the 
channel  capacity  under  overage  ond  peak  power  constroints.  Due  to  mathematical 
difficulties  we  are  only  able  ta  establish  the  channel  capacity  explicitly  when  the 
radiative  losses  become  extreme.  In  this  limiting  case  —  os  a  by-product  —  we  also 
find  that  the  best  encoding  procedure  is  a  binary  on  —  off  system. 
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INFORMATION  CAPACITY  AND  QUANTUM  EFFECTS 
IN  PROPAGATION  CIRCUITS 


I.  INTRODUCTION 

The  recent  development  of  various  types  of  coherent  oscillators  and  amplifiers  in  the  infrared 
and  the  optical  frequency  regions  seems  to  bear  great  promise  for  certain  communication  and 
radar  applications.  It  is  immediately  obvious  that  quantum  effects  will  become  of  considerable 
importance  in  these  frequency  regions  because  of  the  proportionality  of  quantum  energy  and 
frequency.  Any  discussion  of  communication  and  information  capacity  at  these  frequencies  must 
therefore  take  into  account  the  full  effects  of  the  quantized  energy  of  the  waves. 

Some  work  has  been  done  on  quantum  effects  in  communication  systems,  usually  with  empha¬ 
sis  on  the  behavior  of  the  various  elements  at  the  terminals  of  the  system.  In  particular,  the 

problems  relating  to  the  receiving  end  and  the  relative  merits  of  quantum  counters,  linear  ampli- 

1-3 

fiers,  etc.,  have  been  widely  discussed. 

Effects  relating  to  the  transmission  path  itself  have  received  much  less  attention.  Recent 
4  5 

works  by  Heffner  and  Haus  have  reported  certain  progress  along  this  line.  However,  we  want 
to  consider  further  the  fundamental  problem  of  the  transmission  of  information  on  quantized  elec¬ 
tromagnetic  waves  from  a  transmitting  aperture  A  to  a  receiving  aperture  B  as  shown  in  Fig.  1. 
Because  the  actual  propagation  path  is  our  main  interest,  we  shall  only  discuss  the  relation  be¬ 
tween  the  information  which  is  encoded  on  the  electromagnetic  wave  passing  out  of  the  transmit¬ 
ting  aperture  and  the  information  which  is  present  on  the  wave  passing  into  the  receiving  aperture 
Questions  relating  to  the  encoding  and  decoding  procedures  will  be  left  aside  in  this  report.  Our 
problem  is  one  of  information  transmission  from  aperture  to  aperture. 

The  first  question  that  arises  is  concerned  with  the  amount  of  information  which  can  be  car¬ 
ried  by  an  electromagnetic  wave  of  given  bandwidth  and  center  frequency  for  a  single  polarization. 
In  an  early  paper  Stern^  assumed  that  a  message  of  bandwidth  B  and  duration  T  could  be  speci¬ 
fied  by  2TB  numbers,  as  in  the  classical  picture.  In  a  later  work  Gordon^  argued  that  only  half 


]373l-8437  1 


TRANSMITTER 

ENCODER 


RECEIVER 

DECODER 


n 


REGION  OF  CONCERN 
IN  THIS  REPORT 


INFORMATION 

INPUT 


INFORMATION 

OUTPUT 


Fig.  1,  Communication  system. 
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as  many  numbers  could  be  associated  with  a  message.  The  reason  for  this  is  intuitively  clear: 
The  classical  specification  of  a  bandpass  limited  signal  involves  the  in-phase  and  the  quadrature 
components  at  a  rate  of  B.  In  a  quantum  mechanical  formulation,  however,  these  two  compo¬ 
nents  must  be  regarded  as  noncommuting  variables  and  only  one  of  them  can  be  specified  exactly. 
In  this  report  it  is  pointed  out  that  even  Gordon's  ideas  about  the  number  of  degrees  of  freedom 
of  a  message  might  be  somewhat  optimistic. 

In  previous  work  the  "wave  capacity"  was  derived  for  a  plane  wave  only.  This  derivation 
does  not  leave  room  for  considerations  of  radiative  attenuation  effects  and  therefore  cannot  be 
applied  directly  to  practical  communication  situations.  A  transmitting  aperture  can  be  made  to 
excite  several  independent  spatial  modes,  none  of  which  are  plane  waves.  The  distinction  be¬ 
tween  plane  and  spherical  waves  is  unimportant  in  certain  respects.  But  it  must  be  remembered 
that  the  receiving  antenna  usually  couples  weakly  into  one  of  the  spatial  modes  set  up  by  the  trans¬ 
mitting  antenna  -  unless  the  transmitter-receiver  separation  is  very  small.  This  weak  coupling 
between  the  two  apertures  gives  rise  to  a  phenomenon  which  can  be  interpreted  as  partition  noise. 
It  will  be  shown  that  this  noise  has  a  rather  profound  effect  on  the  capacity  of  the  quantum  circuit, 
particularly  when  the  received  power  level  is  exceedingly  low.  In  Gordon's  work,  the  sequence 
of  quantum  numbers  specified  at  the  transmitting  end  reappears  as  the  same  sequence  of  numbers 
at  the  receiving  end  because  of  the  lack  of  partition  noise.  When  partition  noise  is  important, 
however,  the  specification  of  a  sequence  of  quantum  numbers  at  the  transmitter  can  only  serve 
to  define  a  set  of  distribution  functions  for  the  quantum  numbers  at  the  receiving  end,  and  hence, 
even  in  the  absence  of  additive  noise,  the  channel  must  be  described  in  terms  of  a  matrix  which 
exhibits  noise  properties. 

In  what  follows,  we  first  discuss  the  specification  of  a  signal  or  a  message  fed  into*  a  single 
spatial  mode  (single  mode  includes  only  one  polarization).  We  then  go  on  to  discuss  the  concept 
of  spatial  modes  and  evaluate  the  number  of  such  modes  which  can  be  excited  by  a  given  trans¬ 
mitting  aperture.  The  coupling  between  the  transmitting  and  the  receiving  apertures  is  then  es¬ 
tablished  in  such  a  way  that  quantization  of  the  system  can  be  carried  out  rather  easily.  This 
quantization  procedure  leads  to  a  specification  of  the  communication  channel  in  terms  of  a  chan¬ 
nel  matrix.  The  form  of  this  channel  matrix  is  also  worked  out  for  the  case  of  additive  noise 
being  present. 

With  the  form  of  the  channel  matrix  established,  the  channel  capacity  is  determined  through 
a  familiar  variational  procedure.  It  is  shown  that  this  procedure  breaks  down  when  partition 
noise  is  present.  In  fact,  the  solution  of  the  variational  problem  leads  to  certain  negative  input 
probabilities  for  the  optimum  encoding  procedure.  This  situation  has  previously  been  considered 

g 

by  Muroga  who  has  shown  that  certain  of  the  input  "symbols"  (here  quantum  states)  must  be 
omitted.  As  there  does  not  seem  to  be  other  than  a  trial-and-error  procedure  to  the  solution  of 
the  capacity  problem  in  this  case,  we  are  unable  to  establish  the  capacity  of  the  channel  except 
in  the  case  of  extreme  radiative  attenuation  when  the  channel  becomes  binary.  In  spite  of  the 
lack  of  a  general  result  for  the  channel  capacity,  it  is  possible  to  draw  certain  conclusions  about 
the  type  of  encoding  procedure  which  must  be  employed  to  optimize  the  transfer  of  information. 

II.  SPECIFICATION  OF  A  SIGNAL 

In  information  theory,  a  continuous  signal  is  shown  to  be  equivalent  to  a  certain  number  of 

degrees  of  freedom  determined  by  the  duration,  T,  and  the  bandwidth,  B,  of  the  signal.  The 
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number  of  degrees  of  freedom  is  shown  to  be  approximately  equal  to  2TB.  ’  It  is  understood 


2 


that  either  B  or  T  can  be  sharply  defined,  not  both  simultaneously.  There  are  several  ways 
in  which  2TB  quantities  can  be  associated  with  a  signal,  the  most  familiar  ones  being  by  means 
of  sampled  values  either  in  the  time  domain  or  in  the  frequency  domain. 

A  bandpass  limited  time  function,  f(t),  can  be  expressed  as 

f(t)  =  f  (t)  •  cos  u>  t  +  f  (t)  sin  (jo  t  ( 1) 

'  '  c  o  s  o 


where  w  o  is  the  center  frequency  of  the  pass  band,  and  where  fc(t)  and  fg (t)  are  time  functions 
whose  spectra  differ  from  zero  only  when  |  u>  |  <  ttB,  where  B  is  the  signal  bandwidth.  The  two 
functions  f^(t)  and  fg(t)  may  be  represented  as 
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7rB(t  -  tk) 
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k 


k 

B 


(2) 


and  where  f  ,  and  f  ,  are  the  values  of  f  (t)  and  of  f  (t),  respectively,  at  the  times  t  =  t.  .  At 
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each  sampling  point  it  is  therefore  possible  to  specify  two  quantities,  and  hence  2B  quantities 
per  unit  time.  For  a  message  of  approximate  duration  T,  the  number  of  quantities  required 
to  specify  the  complete  signal  is  therefore  2TB.  It  should  be  noted  that  the  particular  sampling 
functions  used  in  (2)  ensure  that  the  sampling  functions  in  the  sums  are  completely  independent 
at  the  sampling  points. 

A  similar  situation  exists  when  we  consider  sampling  in  the  frequency  domain.  A  strict  lim¬ 
itation  of  the  signal  in  time  leads  to  a  sampling  theorem  for  the  frequency  domain.  In  this  case 
it  turns  out  to  be  possible  to  specify  2T  independent  quantities  (i.e.,  amplitude  and  phase)  per 
unit  bandwidth  and  therefore  2TB  quantities  in  all. 

When  we  turn  to  a  quantum  approach  to  the  signal  description,  we  have  to  make  sure  that  we 
are  actually  dealing  with  observable  quantities.  In  the  classical  theory  it  is  possible  to  devise 
"thought  experiments"  whereby  the  sampled  values  actually  appear  as  observables.  This,  how¬ 
ever,  requires  measuring  quantities  with  infinitesimal  energy,  and  the  sampling  procedure 
therefore  does  not  appear  to  be  directly  applicable  in  quantum  theory.  When  making  an  observa¬ 
tion  in  the  quantum  sense,  there  must  always  be  a  finite  energy  associated  with  the  observed 
quantity.  For  this  reason  we  have  to  replace  the  sampled  values  by  some  sort  of  average  over 
a  space  or  a  time  interval. 

One  way  to  overcome  this  difficulty  is  to  interpret  the  time -bandwidth  product  in  a  more  lax 
way  than  is  usual  in  the  classical  signal  theory,  and  assume  that  a  signal  of  approximate  duration 
T  is  represented  in  the  following  manner 


f(t)  = 


cos  Q  t  +  b^ 


sin  u^t) 


(3) 


where  the  increments  in  are  given  by  (2tt/T).  With  this  particular  choice  of  signal  represen¬ 
tation,  we  see  that  the  spectra  of  the  various  elementary  signals  corresponding  to  different  val¬ 
ues  of  £  no  longer  overlap,  and  we  have  therefore  been  able  to  produce  a  set  of  functions  which, 
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even  when  observed  in  a  quantum  sense,  do  not  have  to  interfere.  The  time  function  represented 
by  (3)  is  sharply  defined  in  the  frequency  domain  where  the  upper  and  the  lower  frequency  limits 
are 


f 

max 


1  1 
=  T  ^2  +  2^ 


f  . 
min 


(4) 


Introducing  the  signal  bandwidth  through  B  =  fmax  —  ^min’  we  t^iat  number  °f  independent 
elementary  signals  which  can  be  associated  with  the  total  signal  is 

N  =  B  •  T 


Again  each  of  the  elementary  signals  must  be  specified  by  two  quantities,  such  as  a^  and  b^  in 
(3),  and  the  number  of  degrees  of  freedom  of  the  total  signal  is  the  same  as  in  the  conventional 
theory.  In  the  type  of  representation  in  (3),  however,  the  fraction  of  the  power  which  lies  outside 
of  the  nominal  time -bandwidth  region  BT  is  given  by 


dx  =  °-°97 

X 


(5) 


In  the  conventional  sampling  procedure,  this  factor  would  be  reduced  by  another  factor  of  the 
order  of  l/BT.  The  definition  of  the  signal  in  the  way  suggested  by  (3)  therefore  makes  elemen¬ 
tary  signals  ill-defined  in  the  time  domain.  A  slight  improvement  appears  to  be  possible  by  using 
instead  of  the  sinx/x  modulation  a  prolate  spheroidal  wave  function.  This  refinement  will  make 
very  little  difference  in  our  conclusions  and  we  shall  avoid  introducing  this  complication. 

One  of  several  alternative  ways  of  defining  noninteracting  elementary  signals  is  to  make 
them  sharply  defined  in  time  and  then  construct  a  composite  signal  from  a  sequence  of  elementary 
signals.  In  this  case  the  frequency  definition  would  be  poor  and  the  energy  associated  with  ele¬ 
mentary  signals  would  be  uncertain.  In  this  report  we  shall  find  (3)  most  convenient  as  a  de¬ 
scription  of  an  elementary  signal. 

Having  defined  a  set  of  elementary  signals,  we  must  next  inquire  about  their  description  in 
the  language  of  quantum  theory.  In  particular  we  must  ask  about  the  accuracy  with  which  the  a^ 
and  the  b^  can  be  specified,  and  whether  they  can  be  specified  independently.  For  this  purpose 
we  imagine  that  the  time  function  (3)  has  excited  a  single  spatial  mode  which,  for  simplicity,  we 
take  to  be  a  plane  wave  mode  with  direction  of  propagation  along  the  z-axis.  Because  of  our 
choice  of  elementary  signals,  all  quantities  such  as  Lagrangians  and  Hamiltonians  can  be  ex¬ 
pressed  as  sums  over  contributions  from  the  elementary  signals.  From  now  on,  therefore,  we 
omit  the  subscripts  l,  and  use  cjo  for  the  center  frequency  of  the  particular  elementary  signal 
under  consideration.  The  detailed  quantization  of  elementary  signals  is  carried  out  in  Appen¬ 
dix  A.  An  elementary  signal  is  assumed  to  be  modulated  onto  a  plane  wave  traveling  in  the 
z-direction  and  polarized  along  the  x-axis.  In  Appendix  A  the  wave  is  represented  by  the  x- 
component  of  the  vector  potential  in  the  following  form 


A  (z,  t) 

x'  ' 


sin  "T  *  c  ~  ^ 

2-(z  _t) 

T  '  c  11 


[a  cos  wo(-|  -  t )  -0  sin  ajQ(  f  -  t)  ] 


(6) 
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In  spite  of  the  fact  that  this  is  a  nonharmonic  signal,  it  is  shown  that  the  two  amplitude  factors 
a  and  fi ,  which  contain  all  the  information  about  an  elementary  signal,  can  be  combined  into 
complex  amplitudes  a  and  a*  which  obey  the  same  commutation  relations  as  the  complex  ampli¬ 
tudes  of  an  harmonic  oscillator.  Provided  the  detection  of  the  elementary  signals  is  carried  out 
the  right  way,  it  is  shown  that  a  and  p  can  be  specified  only  to  an  accuracy  corresponding  to 
one  quantum  of  energy 

We  have  therefore  shown  that  a  signal  of  approximate  time -bandwidth  product  BT  =  N  can 

be  specified  by  N  degrees  of  freedom,  each  with  a  minimum  energy  difference  corresponding  to 

one  quantum  ficu  .  This  conclusion  appears  to  be  in  complete  agreement  with  the  assumption  made 
7  ° 

by  Gordon  concerning  the  number  of  states  which  can  be  associated  with  a  signal. 

III.  SPATIAL  MODES  AND  PARTITION  EFFECTS 

In  the  previous  section  we  discussed  the  quantization  of  a  signal  impressed  on  a  single  plane 
wave  mode.  In  practice  it  is  not  possible  to  excite  plane  wave  modes  from  an  antenna  of  finite 
extent,  and  we  must  examine  the  nature  of  the  spatial  modes  which  can  be  excited  by  an  aperture 
of  finite  extent. 

Again  we  imagine  that  the  fields  are  described  by  a  vector  potential  satisfying  the  wave 
equation: 


(7) 


and 


VA  =  0 


(8) 


It  follows  that  a  complete  specification  of  the  transverse  components  of  A  and  their  normal  de¬ 


rivatives  over  an  aperture  will  completely  specify  the  fields  in  front  of  the  aperture  plane.  For 
simplicity,  we  take  the  transmitting  aperture  to  be  in  the  xy-plane  and  to  be  rectangular  (Fig.  2). 
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Fig.  2.  Transmitting  aperture  over  which  fields  are  prescribed. 
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We  now  imagine  that  the  transverse  components  of  the  vector  potential  are  specified  over 
the  aperture,  and  that  the  time  variation  of  the  components  is  in  agreement  with  that  of  one  of 
the  elementary  signals  discussed  in  the  previous  section.  A  general  representation  of  the  x 
and  the  y  components  of  the  vector  potential  in  the  half-space  z  >  0  is 

—  12  1 

A  (r,  t)  -  ( zz~)  -xr  \  dk  \  \  dk  dk  {A  (k  k  k)  exp  [i(k  r  ket)  ] 

x, y  '2*'  Jk  _(Ak/2)  x  yx  x  y 


+  A^y(kxkyk)  exp[-i(kr-kct)]}  .  (9) 

The  field  on  the  aperture  is  found  from  this  by  putting  z  -  0. 

.  n  k  +(Ak/2) 

A  (x,  y,  o,  t)  =  -£r  \  °  dk{B  (x,  y,  k)  exp[— iket) 

x-y  jko_(Ak/2)  x.y 


+  B*  (x,  y,  k)  exp[+ikct]}  ,  (10) 

where  the  quantities  A^  y(k^kyk)  are  related  to  the  y(x,  y,  k)  through  the  relations 


A  (k  k  k)  = 
x,y  x  y 


dxdy  B  (x,  y,  k)  exp  [-i(xk  + 
x,  y  x 


yky)( 


aperture 


(11) 


Because  the  fields  over  the  transmitting  aperture  contain  no  structure  larger  than  t  in  the  x- 

direction  and  l  in  the  y-direction,  it  follows  that  A^  y(kxkyk)  is  completely  specified  provided 

it  is  given  at  certain  particular  values  of  k  and  k  given  by 

x  y 


27rr 

~T~ 


r  =  0,  ±1,  ±2,  .  .  . 


27TS 

£ 

y 


s  =  0,  ±1,  ±2,  .  .  . 


In  analogy  with  the  sampling  theorem  of  Shannon  or  the  aperture  synthesis  procedure  introduced 
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by  Woodward  and  Lawson,  we  therefore  may  put 


A  (k  k  ) 
x,y  x  y 


y  y  A  (hi  2™\  _ 

L  L  x,y  \  l  *  l  }  1 


sinl  fx(kx  ~  T~r)  sinYly(ky-  rs) 


2  fx 


(12) 


where  we  have  omitted  k  in  the  argument  of  A^  because  the  field  amplitudes  are  independent 
of  frequency  within  the  frequency  range  of  an  elementary  signal.  Only  an  enumerably  infinite 
set  of  modes  can  therefore  be  excited  by  an  aperture  of  limited  extent.  Of  these,  the  modes 
corresponding  to 

(^r)Z+  (Zp)2  >k2  (13) 


will,  however,  not  be  able  to  propagate  away  from  the  aperture.  They  correspond  to  evanescent 
waves.  This  leaves  only  a  finite  number  of  possible  propagating  modes.  These  will  be  the  only 
ones  of  interest  when  it  comes  to  discussion  of  information  transfer  between  distant  apertures. 
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It  is  instructive  to  determine  the  number  of  independent  spatial  modes  corresponding  to 
traveling  waves  which  can  be  excited  by  the  transmitting  aperture.  The  solid  angle  occupied  by 
each  spatial  mode  is  given  by 


Aft 


47 r 


mode 


f  f  k  cos  0 
x  y  o  r,s 


(14) 


where  0^  is  the  angle  between  and  the  z -direction.  The  number  of  propagating  modes  per 

solid  angle  is  the  inverse  of  this,  and  the  total  number  of  propagating  modes  therefore  becomes 


N 


=£ 


d(ft) 


semi-space 


Aft 


mode 


i  i 

*  y 

47T 


(15) 


which  is  seen  to  be  one -quarter  of  the  maximum  gain  of  the  aperture.  Each  of  these  spatial  modes 
can  of  course  support  two  orthogonal  polarizations  of  the  waves  (circular,  linear,  or  other  types). 

On  the  transmitting  end  we  can  therefore  excite  only  a  limited  number  of  spatial  modes,  each 
in  two  different  orthogonal  polarizations.  The  elementary  signals  impressed  on  the  various  spa¬ 
tial  modes  can  be  specified  in  exactly  the  same  manner  as  was  done  for  the  plane  waves  of  the 
previous  section.  The  artificial  limiting  of  the  cross  section  of  the  plane  wave  is  now  replaced 
by  a  natural  limitation  caused  by  the  finite  lateral  extent  of  each  of  the  spatial  modes. 

Next  we  must  discuss  the  coupling  of  the  receiving  aperture  into  the  fields  excited  by  the 
transmitting  antenna.  We  first  carry  through  a  discussion  along  completely  classical  lines  be¬ 
fore  we  go  on  to  a  quantum  description  of  the  phenomena. 

Denote  the  Poynting  vector  set  up  by  a  particular  transmitting  mode  specified  by  (r,  s,  p)  by 
The  parameter  p  signifies  the  state  of  polarization,  and  the  parameters  r,  s  the  direction. 
The  maximum  power  which  can  be  absorbed  by  the  receiving  antenna  becomes 


(r\s) 


^  dx'dy'  n  •  g^(x',y') 

aperture 


(16) 


The  integration  extends  over  the  whole  of  the  receiving  aperture;  n  is  the  normal  to  the  aperture 
surface  and  g^(x!,y!)  is  a  weight  factor  relating  to  polarization  and  to  the  particular  type  of  an¬ 
tenna  employed.  The  total  power  put  into  the  (r,  s,  p)  mode  by  the  transmitter  (index  t)  is  of  the 
form 


(r,s) 


S,M  . 

(r,s) 


da 


(17) 


We  can  now  introduce  the  ratio  of  aperture  power  to  total  power  for  each  spatial  mode  (r,  s,  p) 


,>>  = 
r,s 


p(M) 

rs 

p  ( M-)  t 


rs 


(18) 


This  number  of  course  is  always  less  than  or  equal  to  unity.  When  one  or  more  of  the  r's  is 

equal  to  unity,  power  may  be  transferred  from  the  transmitting  to  the  receiving  aperture  without 
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loss.  This  is  essentially  what  is  implied  in  most  earlier  work  on  quantum  communication. 

In  order  to  see  when  such  close  coupling  between  transmitting  and  receiving  apertures  is 
possible,  we  may  argue  as  follows:  The  opening  angle  corresponding  to  each  spatial  mode  is 
given  approximately  by 
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(19) 


A£2 


a  cos  6t 


where  X  is  the  wavelength,  the  transmitting  aperture  area  and  0^  the  angle  between  the  beam 
direction  and  the  normal  to  the  aperture.  The  area  of  the  beam  at  the  receiver,  located  a  dis¬ 
tance  R  away  from  the  transmitter,  becomes 


beam 


Afi  •  R 


(20) 


If  this  area  is  smaller  than  the  projected  area  of  the  receiving  aperture,  i.e.,  a  ^  cos  0^,  then  a 
complete  power  transfer  can  take  place.  The  condition  for  this  is  seen  to  be 


ra~a  cos  O  cos  0, 

R  <  J-t-E - ^ 1  '  (21) 

2  ° 

As  an  illustration,  consider  the  following  numerical  example:  =  1cm  ,  X  =  6000  A.  Sub¬ 

stitution  into  the  inequality  shows  that  R  is  less  than  166  m.  In  any  practical  communication 
circuit,  therefore,  we  must  be  prepared  to  use  only  one  of  the  possible  spatial  modes  which  can 
be  excited  at  the  transmitter,  and  we  must  also  take  full  account  of  radiative  attenuation  effects 
in  the  quantum  description  of  the  circuit.  As  we  shall  see,  this  leads  us  to  a  partition  noise  ef¬ 
fect  which  will  cause  a  serious  decrease  in  information  capacity  of  a  communication  channel  when 

the  received  power  is  extremely  low.  Such  partition  effects  have  been  described  previously  by 
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Haus  and  it  is  implicit  in  some  work  by  Heffner. 

Let  us  next  try  to  study  the  relation  between  the  transmitted  and  the  received  signal  in  terms 

of  the  ideas  developed  above.  We  consider  only  one  particular  transmitted  mode,  namely  that 

corresponding  to  constant  phase  across  the  aperture,  only  one  of  the  two  possible  polarizations, 

and  one  of  the  elementary  signals  corresponding  to  the  center  frequency 

The  fields  over  a  distant  plane  z  =  R  in  front  of  the  transmitting  antenna,  and  R  denotes  a 

point  on  this  plane.  The  fields  over  this  reference  plane,  or  receiving  aperture  plane,  are 

,  y  .  pk  +( Ak/  2)  pp 

=  AkJk°.(Ak  2)  dkii  dkxdkyFt(kxky){Btexp[i(kR-kct)l 


+  B*  exp  [— i(kR  -  kct)  ]}  =  B^R.t)  +  B*g*(R,t)  . 

Here  B  is  the  mode  amplitude;  see  Eqs.  (11)  and  (12).  F  (k  k  )  is  defined  by 
t  t  x  y 


(22) 


F  (k  k  ) 
t  x  y' 


sin  |  k  t  sin  t  k  ( 

2  x  x  2  y  y 


J  k  1 
2  xx 


T  k  * 

2  y  y 


(23) 


and  the  functions  g^(R,  t)  and  g^(R,t)  are  just  shorthand  for  the  integrals  in  (22).  Now  the  fields 
in  all  space  can  also  be  derived  from  a  knowledge  of  the  fields  over  the  receiving  aperture  plane 
z  =  R. 

However,  we  divide  the  receiving  aperture  plane  into  two  parts:  one  part  coinciding  with  the 
receiving  aperture,  and  the  other  part  consisting  of  the  rest  of  the  aperture  plane  (Fig.  3).  The 
new  description  of  the  fields  therefore  contains  two  terms,  one  corresponding  to  either  of  the  two 
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Fig.  3.  Transmitting  and  receiving  apertures  with  receiving  aperture  plane. 


parts  of  the  receiving  aperture  plane.  In  the  new  representation  with  two  terms,  we  therefore 
obtain  expressions  of  the  form 


(24) 


which  corresponds  to  the  receiving  aperture  mode,  and  a  similar  expression  for  A^(r,  t)  corre¬ 
sponding  to  the  mode  derived  from  the  rest  of  the  receiving  aperture  plane.  We  are  now  in  the 
position  to  quantize  the  system  and  we  interpret  the  parameters  B^,  B^  and  B^  [corresponding 
to  A^(r,  t)]  as  operators;  and,  furthermore,  introduce  creation  and  annihilation  operators  as  in 
Appendix  A.  The  following  equations  between  the  various  operators  are  obtained: 


=  p*  a*  +  P? a? 
t  r  r  l  i 


(25) 


The  parameters  and  P^  can  be  derived  directly  from  the  above  equations  (Appendix  B).  They 
can,  however,  also  be  derived  from  physical  arguments  as  we  shall  show  shortly.  In  what  fol¬ 
lows,  we  refer  to  the  transmitted  signal  system  as  the  t-system,  the  system  derived  from  the 
receiving  aperture  field  distribution  as  the  r-system,  and  the  system  representing  the  lost  radi¬ 
ation  as  the  /-system. 

The  t-system  is  specified  by  means  of  a  set  of  eigenvectors  of  the  operator  a*a^,  and  the 
r-  and  /-systems  by  means  of  eigenvectors  of  the  operators  a^a^  and  a^a^,  respectively.  Be¬ 
cause  the  r-  and  /-systems  are  independent,  a  state  of  the  t-system  can  quite  generally  be  ex¬ 
pressed  as  follows^ 


(26) 


n  m 
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(N) 

where  the  parameters  C  are  to  be  determined.  It  is  well  known  that  the  various  state  vectors 

n,m 

can  be  expressed  by  means  of  the  state  vector  |  0>t  through  the  relation 

a*N 

|N>t  =  4=  |0>.  ,  (27) 

Vn!  1 


where  a*  is  the  creation  operator  of  the  system,  here  the  t-system.  If  the  same  expansion  is 
used  for  the  r-  and  the  ^-systems  as  well,  the  following  relation  is  obtained 


I  NX 


°>t=n  c<: 


(N) 

m 


*  n  *  m 
ar  af 


\fnh 


|o>  |o> 


(28) 


Because  the  empty  state  of  the  t-system  must  correspond  to  both  r-  and  f-systems  being  in  the 
ground  state  (no  additive  noise),  we  conclude  that 

l °>t  =  l °>r  I °>f  •  (29) 

This  means  that  the  following  identity  must  be  fulfilled  by  the  operators 


,N 


=  EE 


,  (N)  VnT 


n  m 


*  n  *  m 
a*  a: 
r  f 


(30) 


A  similar  relation  between  the  operators  can,  however,  be  derived  directly  from  the  latter  of 
Eq.  (25). 


a* 


N 


=  E  E^y 


N! 


pj;<  np*  m 

r  t 


6  ^  a*na?m 

m,N-n  r  f 


(31) 


Because  the  two  operator  equations  must  be  identical,  we  conclude  that 


r(N)  _  /  N •  ”  x 

n,m  vn!m!  r  t  °m,N -n 


(32) 


We  notice  that  m  +  n  =  N,  which  expresses  the  energy  conservation  of  the  system.  Hence,  the 
expansion  of  (26)  simplifies  to 


|N>t 


N 

y  /  n; 

vn!(N  —  n)! 

n=0 


p  ^  p  ^  ^  ""  ^1 

r  f 


n> 


N  -n>( 


(33) 


From  the  normalization  condition,  it  is  obvious  that 

1  =  (|Pr|2  +  |Pf|2)N  . 


hence, 

I  Pr  I  2  +  |Pf|2  =  1  (34) 

The  transition  probabilities,  i.e.,  the  probabilities  of  reception  of  state  |n>^  |N  —  n>^  when  the 
transmitted  signal  is  in  state  |N>t,  is  given  by  the  square  of  the  projection  of  |n>r  [N  —  n>^  upon 

|N>t, 

p(n|N)  =  |r<n|f<N -n|N>t|2  =  (JJ)  |Pr|2n|Pf  |(N‘n)2  .  (35) 
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The  significance  of  |p  |  is  seen  by  considering  the  transition 

p(l|  1)  =  I  Pr  I  2  =  V  .  (36) 

Hence,  v  must  be  interpreted  as  the  probability  of  receiving  one  photon  when  one  was  sent. 

This,  of  course,  is  equal  to  the  transmission  loss  of  the  propagation  circuit,  or  the  ratio  of  re¬ 
ceived  to  transmitted  power.  Because  of  (34),  we  then  have  for  the  transition  probability  of  the 
propagation  circuit 

p(n|N)  =(|J)  »>n(l  -y)N"n  ,  (37) 

which  can  also  be  derived  from  purely  intuitive  arguments/  As  is  seen  from  Appendix  B,  the 
above  interpretation  of  the  and  P^  is  completely  borne  out  by  the  more  laborious  calculations 
presented  there.  The  transition  probability  p(n|N)  is  seen  to  have  the  appearance  of  containing 
partition  effects.  The  fact  that  a  distinct  transmitted  signal  might  be  received  as  a  number  of 
different  signals  means  that  the  partition  effects  make  the  propagation  circuit  noisy  even  in  the 
absence  of  additive  noise. 

IV.  PROPAGATION  CIRCUIT  AS  A  COMMUNICATION  CHANNEL 

In  the  previous  two  sections  we  have  discussed  the  breaking  up  of  a  signal  into  independent 
elementary  signals,  and  we  have  established  a  transition  probability  for  elementary  signals.  The 
complete  transition  probability  is  identical  to  the  product  of  the  elementary  signal  probabilities. 

In  what  follows,  the  quality  of  the  propagation  circuit  will  be  evaluated  in  terms  of  the  mutual  in¬ 
formation  between  transmitted  and  received  signals.  In  the  optimization  problem  it  will  be  as¬ 
sumed  that  there  is  an  average  power  constraint  on  the  total  signal.  If  the  mutual  information  of 
elementary  signals  does  not  increase  more  rapidly  than  linearly  with  the  signal  energy,  then  the 
average  energy  should  be  distributed  evenly  among  the  elementary  signals  (assuming  the  relative 
bandwidth  of  the  total  signal  to  be  small).  It  is  therefore  sufficient  to  solve  the  optimization  prob¬ 
lem  for  a  single  elementary  signal  and  afterwards  make  sure  that  the  mutual  information  asso¬ 
ciated  with  an  elementary  signal  does  not  increase  more  strongly  than  linearly  with  the  elemen¬ 
tary  signal  mean  energy.  We  begin  by  defining  the  channel  matrix  for  the  elementary  signal 
channel  first  without  additive  noise,  then  with  noise.  We  go  on  to  optimize  the  mutual  informa¬ 
tion  under  average  power  constraint  both  with  and  without  additive  noise,  and  we  show  that  the 
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results  become  identical  to  those  of  Gordon  for  the  case  of  v  1,  i.e.,  when  there  is  no  parti¬ 
tion  noise.  It  is  then  shown  that  the  results  are  improper  whenever  v  <  1  because  the  familiar 
method  with  Lagrange* s  multipliers  employed  in  the  general  analysis  does  not  ensure  that  all 
probabilities  are  positive.  In  order  to  understand  what  happens,  we  construct  certain  simple 
channels  and  discuss  them  in  detail.  From  this  particular  discussion,  it  is  possible  to  general¬ 
ize  the  results  and  draw  conclusions  about  the  channel  capacity  in  the  case  of  extremely  weak 
received  signals. 


t  This  is  olso  of  the  same  form  os  the  result  derived  by  Shimodo,  et  o!.,  for  attenuotion  through  obsorption 
processes. 
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A.  Definition  of  Elementary  Signal  Channel  Matrix 

As  derived  in  the  previous  section,  the  transition  probability  between  signal  state,  N,  at 
the  transmitter  and  signal  state,  n,  at  the  receiver  is 

p(n|N)  -  (jj)  yn(l  -  ^)N"n  .  (38) 

i.e.,  a  binomial  or  Bernoulli  distribution.  We  note  here  that  the  number  of  received  quanta  is 
always  less  than  or  equal  to  the  number  transmitted.  If  the  transition  probability  is  expressed 
in  matrix  form,  therefore,  the  matrix  becomes  triangular.  The  assumption  that  the  transmitted 
signal  can  be  specified  to  an  exact  number  of  quanta  might  seem  rather  unrealistic.  In  practice, 
however,  when  partition  effects  are  important,  the  conditional  probability  (38)  is  rather  insensi¬ 
tive  to  changes  in  N,  so  it  is  sufficient  to  specify  N  approximately.  None  of  the  conclusions  in 
this  paper  will  be  affected  by  a  "diffuse"  specification  of  the  transmitted  signal. 

We  now  turn  to  the  question  of  additive  noise  in  the  path.  The  existence  of  additive  noise 
will  no  longer  guarantee  that  fewer  quanta  arrive  than  were  transmitted  and  the  transition  matrix 
becomes  less  simple.  If  the  noise  is  of  the  equilibrium  type  associated  with  the  temperature,  T, 
then  matters  are  rather  straightforward.  The  probability  of  j  quanta  being  present  in  a  mode 
becomes 

PQ(j)  =  (!  -  e-0)  expl-jG]  with  0  =  ^-  ,  (39) 

and  where  K  is  Boltzmann’s  constant.  When  noise  is  included,  the  transition  probability  becomes 

n 

p'(n|N)  =  £  p(z  |  N)  pQ(n  —  z)  .  (40) 

z=0 


For  later  purposes,  we  shall  express 
trix  is  then  termed  A,  with  elements  a^. 
For  the  noiseless  case,  p( j  |  i)  =  a^. 


this  in  terms  of  a  matrix  product.  The  noiseless  ma- 

The  noise  matrix  is  termed  B,  with  elements  b...^ 

ij 


p(°l 

0) 

0 

0 

0 

p(°l 

1) 

p( 1 1  4) 

0 

0 

p(0  1 

2) 

p(  1 1  2) 

p(2|2) 

0 

If,  in  the  noise  case,  the  same  meaning  is  to  be  attached  to  the  various  elements  of  the  resultant 
transition  matrix  A',  we  must  arrange  the  noise  terms  as  follows: 


t  Note  thot  in  oil  our  motrices  we  count  lines  and  columns  not  from 


unity,  os  usuol,  but  from  zero. 


12 


Po(0)  po(1)  Po(2)  - 


B 


0  Po(0)  P0(1)-' 

0  0  pQ(0)... 


and 


A'  =  AB 


In  passing,  we  observe  that  the  inverse  matrix  can  be  expressed  in  terms  of  the  inverse  of  the 
elementary  matrices 

- 1  - 1  - 1 

(A’)  =  B  aA  1 


B.  Optimization  with  Lagrange  Multipliers 

In  this  section  we  first  carry  out  an  optimization  of  the  mutual  information  as  defined  by 
Fano1^  for  a  general  transition  probability  p(n|N)  which  may  or  may  not  include  noise.  The  cal¬ 
culation  is  carried  out  with  an  average  power  constraint  at  the  transmitting  aperture.  In  this 
situation  our  choice  lies  in  the  selection  of  a  proper  distribution  function  for  the  occupation  num¬ 
ber,  N,  at  the  transmitter,  pt(N),  or  a  distribution  function  pr(n)  at  the  receiving  end.  After  the 
general  solution  to  this  problem  has  been  found,  we  go  on  to  evaluate  the  noiseless  capacity  and 
the  capacity  with  noise  present  (formally  only.1). 

The  mutual  information  for  an  elementary  signal  is  defined  as 

I(N;  n)  =  H(n)  -  H(n  |  N) 


OO 


OO  OO 


=  E  Pr(n)  logPr(n)  +  E  E  Pt<N)  P(°IN)  l°g  p(n  |  N) 

n-0  N=0  n=0 


00  00 


“EE  Pt(N)  p(n|N)  log  E  Pt(N)  p(n|N) 
n=0  N=0  N  =0 


+  E  E  Pt(N)  p(n|N)  logp(n|N) 
n=0  N=0 


(41) 


This  is  to  be  maximized  subject  to  the  two  constraints  on  the  unknown  distribution  function  p  (N) 
00 

Yj  pt(N)  =  1  normalization  , 

N=0 


OO 

Yj  Npt(N)  =  N  average  power  constraint 
N=0 
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Using  the  familiar  methods  of  Lagrange,  with  the  constraints  included  after  multiplication  by 
indeterminate  factors,  we  have  to  form  the  expression 


9 

9pt(N) 


I(N;  n)  -  a'  Yj  Pt<N)  ~  P  Y  Pt(N) 
N  N 


and  equate  it  to  zero.  This  leads  us  to  the  equations 


V  {p(n|  N)  [1  +  logpr(n)  -  log p(n |  N)  ]}  +  a1  +  j3  •  N  =  0  .  (42) 

n=0 

Both  sides  of  the  equation  are  now  multiplied  by  pt(N)  and  a  summation  is  carried  out  over  N. 
This  gives 

00  OO  OO 

-  Yi  Pr(n)  logPr(n)  +  Y  Z  Pt<N)  p(n|N)  logp(n|N)  =  1  +  ot'  +  0N  =  a  +  |3N  .  (43) 

n=0  N  =  0  n=0 

But  this  is,  at  least  formally,  the  maximum  mutual  information 


C 


M 


=  a  +  /3N 


(44) 


The  real  problem  now  lies  in  the  determination  of  the  Lagrange  multipliers  a  and  p.  To  do  this, 
we  have  to  evaluate  the  distribution  function  Pr(n)  or  p^(N)  and  use  the  constraint  equations.  To 
carry  out  this,  we  proceed  as  follows.  Rearranging  Eq.  (42),  we  have 


since 


Y  p(n|N)  logpr(n)  =  Y  p(n|N)  (logp(n|N)  -  a  -0] 
n=0  n-0 


Y  P(n  I N)  =  1 

n=0 


(45) 


We  therefore  have  an  infinite  set  of  linear  equations  in  the  unknowns  x  =  iogp  (n),  one  equation 

t  h  nr* 

for  each  value  of  N.  Regarding  x^  as  the  n  component  of  a  column  vector,  and  also  defining  a 
column  vector  by  the  components 


dN  '  p(n|  N)  [logp(n|  N)  -  a  -  j3N)  ,  (46) 

n=0 

we  may  write  the  set  of  equations  as  follows 


A'  x  =  d  ,  (47) 

where  A'  is  the  matrix  form  of  the  transition  probability  introduced  previously.  The  formal  so¬ 
lution  to  the  problem  therefore  is 


x  =  A-‘1d  or  xn=  Y  <A,'VNdN 


N=0 


(48) 
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For  the  probability  distribution  at  the  receiving  end  we  therefore  have 


pr(n)  =  exp 


E  <A' 


ln=o 


^n,N  dN 


(49) 


There  are  still  the  two  unknowns  a  and  0  to  be  determined,  and  two  equations  are  needed. 
These  are  found  from  the  constraint  equations  in  a  slightly  modified  form,  viz., 

OO  OO 

E  pr<n>  = 1  E  npr<n) =  »»  •  <5°) 

n=0  n-0 


In  the  noiseless  case,  we  are  first  of  all  faced  with  the  problem  of  inverting  the  lower  tri¬ 
angular  infinite  matrix 


10  0 


(1  - 

r) 

V 

0 

(1- 

2 

2r(  1  -  v) 

z 

r) 

V 

(1  - 

r)3 

3r(  1  -  v)Z 

3yZ(l  —  v) 

.  m  ,  .  v m- 1  /m\  2.  .  .  m-2 

(l  -  V)  m(l  -  v)  (  2  j  V  (1  -  v) 


To  avoid  going  into  detail,  we  just  state  the  result.  The  general  term  is  of  the  form 
(A-1).  =  (  j)  (i  —  r)1_J  (— l)1-^  r’1  [note  that  (j)  =  0  for  j  >  ij 

or  explicitly 


(51) 


-(1  -  v)  ^ 


-1 


+  r 


+(l-r)2r-2  —2(1  —  v)  r-2  +r-2 
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With  these  expressions,  we  obtain 


xn  =  logpr(n)=  Yi  (A"\,N  E  p(i|N)  [log p(i |  N)  -  a  -0N] 

N=0  ’  i=0 

=  -a  -  j  [P  -  v  logy -(1  -  v)  log(l  -  y)]  +  Yi  E  <A_1)n>N  P<i  I  N)  log(^)  .  (52) 

B  i 

7 

For  the  particular  case  of  v  =  1,  this  reduces  to  a  result  identical  to  that  of  Gordon 

pr(n)  =  exp  [-a  —  n/3  ]  .  (53) 

Substitution  into  (50)  gives 

a  =  log  ( 1  +  n)  , 

/3  =  log  ( 1  +  n)  —  log n  ,  (54) 

and  the  elementary  signal  entropy  becomes 

CM  =  < 1  +  log  < 1  +  n)  logn  .  (55) 

This  can  be  recognized  as  the  mode  entropy  of  a  system  of  particles  obeying  Bose-Einstein 
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statistics. 

Before  discussing  the  case  of  v  <  1  in  detail,  we  also  write  down  the  formal  solutions  for  the 
noisy  case  and  compare  the  capacity  for  the  case  v  =  1  with  previous  results.  For  equilibrium 
noise,  the  matrix  B  becomes,  in  agreement  with  (39), 


1 

-0 

e 

-20 

e 

-30 

e 

0 

1 

-0 

e 

-20 

e 

0 

0 

1 

-0 

e 

0 

0 

0 

1 

This  has  to  be  inverted  in  order  to  give  the  inverse  matrix  (A'-*)  required  for  the  solution  for 
Pr(n).  The  inverse  of  this  noise  matrix  is  of  the  peculiarly  simple  form 


-1 


1  —  e 


-9 


-0 


(57) 
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”1  - 1 

which  can  be  checked  by  forming  BB  .  Instead  of  the  (A  )  used  above,  we  must  now 
substitute 


( A 1 


\.m  -  2  =  7^  t'A'Vm  - 


e"0(A"1) 


n+l,m J 


(58) 


The  transition  probabilities  must  also  be  modified  in  accordance  with  Eq.  (40)  and  now  become 

n 

irt  /  AT  \  i  TM  _i  _  A 

(59) 


p(n|N)  =  e  Yj  e^(  j*  )  ^(1  —  ^  ( 1  —  e  ®) 

j=0 


This  can  be  checked  for  two  particular  cases.  First,  let  the  noise  disappear,  i.e.,  T  0  and 
hence  0  -*  00 .  Only  one  term  survives  in  the  above  sum,  namely  that  for  which  j  =  n,  and  we 
have 


lim  p(n|N)  =  rn(l  —  v)^*  n  , 


T-0 


(60) 


which  is  the  noiseless  case.  The  other  particular  case  is  that  for  which  v  -*•  1,  i.e,,  the  intrin¬ 
sic  lack  of  determinism  of  the  channel  in  the  absence  of  noise  is  removed,  and  we  obtain 


exp  [— (n  —  N)  0  ]  ( 1  —  e  ®)  n>N 
0  n  <  N 

In  this  case  the  noiseless  channel  matrix,  A,  diagonalizes  and  we  have,  instead  of  (58), 


lim  p(n  |  N)  = 
1 


(A'_1)  =  (B"1) 

n,m  n,m 


(61) 


(62) 


With  these  simplifications,  we  have 

Z  (A,_1)nm  S  PU I  m)  logp(i|m) 


Z  (A,_1)nm  (l-e‘°)  Z  exp  [—  ( i  -  m)  0]  [log(l  -  e“°)  -  (i  -  m)  G] 
m  i=m 

-  Z  (B-1)n,m  [U  +  L)  log(l  +  L)  —  L  log  L] 


=  -  ((1  +  L)  log  ( 1  +  L)  —  L  log  L]  -  -T 

where  L  is  the  mean  noise  occupation  number  defined  by 

-0 


L  = 


-0 


1  —  e 

The  distribution  at  the  receiver  hence  becomes 
p  (n)  =  exp  [—  cc  +L0  —  j3n  —  T] 


(63) 


(64) 


(65) 


This  distribution  is  substituted  into  the  two  constraint  equations 
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(66) 


Y  Pr(n)  =  1  and  Y  npr(n)  =  L  +  S 
n  n 

where  S  is  the  mean  signal  occupation  number.  The  results  for  a  and  p  are 
a  =  ( 1  +  L)  log  ( 1  +  L  +  S)  -  L  log  (L  +  S)  -  T 

p  =  log  ( 1  +  E  +  S)  -  log  (L  +  S)  ,  (67) 

and  the  capacity  becomes 

CM  =  a  +  Sp  =  [(1  +  L  +  S)  log(l  +  L  +  S)  -  (L  +  S)  log(L  +  S)] 

—  [( 1  +  L)  log  (1  +  L)  —  L  log  L]  ,  (68) 

which  is  the  difference  of  the  entropy  of  signal  plus  noise  minus  the  entropy  of  the  noise  alone, 

7 

which  was  to  be  expected,  and  which  is  in  agreement  with  Gordon's  result. 

It  should  be  noted  that  the  function  entering  the  expressions  for  the  capacity  in  (68)  and  in 
(55),  namely, 

f(x)  =  ( 1  +  x)  log  ( 1  +  x)  —  x  log  x  , 

has  a  negative  second  derivative  anywhere  as  long  as  x  >0.  This  means  that  the  elementary 
signal  capacity  increases  less  rapidly  than  linearly  with  energy,  and  hence  the  total  signal  en¬ 
ergy  should  be  distributed  on  the  average  evenly  among  the  various  elementary  signals. 

C.  Mutual  Information  When  Partition  Effects  Are  Present 

In  order  to  evaluate  the  capacity  in  agreement  with  the  previous  section,  the  parameters 
a  and  p  have  to  be  solved  from  the  set  of  equations 


L  pr<n>  = 1  - 

n=0 

OO 

Y  npr(n)  =  L  +  S  ,  (69) 

n=0 


with 


where 


with 


pr(n)  =  exp  [-a  -  —  P  +  Gn(L,  v)  ] 


oo  oo 


Gn(L’  v)  =  Y  Y  <A'  1^n,m  P(i|m)  l°gp(i|m) 

m=0  i=0 


p(i|  m)  = 


1  +  L 


(  L  )L 

V  1 

fl  +  lV  (m\ 

M  +  L' 

L  \ 

'  l  '  '  i 

v) 


m-j 


3=0 


(70) 


(71) 


(72) 
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and  with 


(A'-1) 


n,m 


=  (1  + 


L)  (A-1) 


n,m 


-  UA_1) 


n+l,m 


(73) 


where 


/A-1.  (n\fA  \n-m/  ^n_m  "n  tnA\ 

(A  >n,m  =  v  ■  (74) 

It  was  at  first  thought  that  these  equations  could  be  solved  by  fixing  the  parameters  L  and 
vt  evaluating  Gn(L,  v)  on  an  electronic  computer,  and  then  proceeding  to  solve  the  equations  for 
/ 3  and  a  numerically.  As  an  illustration  of  the  difficulties  then  encountered.  Table  I  presents 
one  of  the  resulting  sequences  obtained. 


TABLE  1 

NUMERICAL  EVALUATION  OF  G  (L, v) 

n 

n 

Cn(L,P) 

n 

Gn(l» 

0 

0 

4 

335.2726 - 

1 

-3.2508297— 

5 

-2524.671 . 

2 

5.9749897— 

6 

19537.6 - 

3 

-49.996700 - 

7 

-154812. . — 

L  =  0  (no  noise),  v  =  l/l 0 - 

It  is  immediately  obvious  that  such  a  sequence  when  substituted  into  pr(n)  in  Eq.  (70)  does 
not  correspond  to  a  valid  physical  situation,  the  main  reason  being  that  a  distinction  occurs  be¬ 
tween  the  probability  of  reception  of  even  and  odd  numbers  of  quanta.  These  difficulties  arose 
at  all  the  combinations  of  parameters  chosen. 

Although  we  were  not  able  to  prove  that  the  following  explanation  is  correct,  it  is  thought 
that  the  subsequent  discussion  and  considerations  of  certain  models  make  this  explanation  very 
plausible.  In  the  calculations  to  evaluate  the  maximum  mutual  information  presented  in  Sec.  IV-B 
there  is  no  constraint  on  the  transmitter  probabilities  Pt(N)  to  ensure  that  they  are  all  positive. 
What  is  believed  to  happen  in  the  case  considered  above  is  that  the  occurrence  of  certain  negative 
transmitting  probabilities  (input  probabilities)  explains  the  ,!unphysicaln  behavior  of  the  receiving 
probabilities  pr(n)  (output  probabilities).  This  explanation  will  become  more  plausible  after  our 
presentation  of  certain  simplified  models  below.  The  described  difficulty  is,  by  the  way,  not  un- 

g 

known  in  information  theory  and  has  been  described  in  detail  by  Muroga.  To  the  knowledge  of 

the  author,  no  satisfactory  method  has  yet  been  developed  in  information  theory  to  handle  the  sit- 

10 

uation.  The  trial-and-error  method  suggested  by  Muroga  and  by  Fano  is  clearly  completely 

impracticable  in  our  case  where  extremely  large  numbers  of  possible  signals  are  available  at  the 

transmitter.  It  is  possible  that  the  extra  constraints  required  to  keep  p  (N)  ^  0  could  be  ensured 

18  ^ 

by  some  linear  programming  technique.  Unfortunately,  we  have  not  been  able  to  develop  an 
adequate  method  here.  In  the  next  section  we  therefore  consider  a  simple  model  where  negative 
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input  probabilities  do  occur,  but  where  the  system  is  so  simple  that  we  can  follow  in  detail  what 
happens. 

D.  Ternary  Model  Channel 

As  the  simplest  possible  example,  consider  the  case  of  the  elementary  signals  only  being 
allowed  to  assume  the  signal  states  N  =  0,  1  and  2,  and  let  there  be  no  additive  noise.  As  a 
channel  matrix,  we  use 


A 


1  0  0 

1  —  v  v  0 

( 1  -  r)2  Zv(  1  -  v)  vZ 


(75) 


which  is  seen  to  correspond  to  the  upper-left  corner  of  the  infinite  channel  matrix  A  of  Sec.  IV-A. 
We  can  use  the  same  procedure  as  there  to  evaluate  the  input  and  output  probabilities.  Because 
there  is  no  average  power  constraint  in  our  simplified  model,  Eq.  (44)  reduces  to 

C  =  a  ,  (76) 

— ► 

and  Eq.  (46)  for  the  column  vector  d  in  this  case  becomes 


d 


+2y(  1  —  v)  log  Z 


(77) 


with  the  following  definition  for 

H  .  =  —  v  log  v  —  ( 1  —  u)  log  (l  —  i^)  (78) 

If  we  note  that  the  inverse  of  our  present  particular  channel  matrix  corresponds  to  the  upper 
three  by  three  corners  of  the  general  inverted  channel  matrix,  we  obtain  immediately  for  the 
output  probabilities  from  Eq.  (49) 


'pr(0)’ 

exp  [— C] 

pr  =  • 

Pjj!) 

>  =  , 

exp  [-C  -(i/v)  HJ 

pr(2)J 

exp  {— C  -  (2/ y)  H1  +  2  [(1  -  v)/v]  log  2} 

We  can  immediately  see  that  these  results  are  not  valid  for  large  transmission  losses,  i.e., 
v  0,  because  for  small  v  it  turns  out  that 

Pr(2)  »  Pr<*>  * 

and  this  does  not  appear  to  be  physically  possible,  whatever  the  input  probability  pt(N).  The 
channel  capacity  is  found  directly  from  the  normalization  condition  on  p^  to  be 


Z0 


C  =  log{l  +  expl-iHj]  +  exp[-i  H1  +  2  log  2  ]} 


(80) 


For  the  transmitted  probabilities  we  obtain 

pt(0)  =  e'C{l  -  ^  expf-i-HJ  +  exp[-|  Hj  +  2  ^  log  2  ]  J  , 

PtU)  =  e‘C{7  exp  [—  -jj  H  ]-2  \  (1  -  v)  exp[-f  H  +  2  log2]}  , 

K  v  J 

pt(2)  =  e_c{^  exp[-f  Hi +  2  1os21}  •  (81) 


It  turns  out  that  the  limiting  values  for  these  probabilities  when  v  —  0  are 
lim  Pt(0)  =  +°° 

lim  p  ( 1)  =  — 00 

lim  p^(2)  =  +°° 

Upon  closer  examination  of  pt(l),  it  is  found  that  this  probability  goes  negative  when 

v  -  v  ~  0.62 
c 


The  p^,  p^.  and  C  are  all  plotted  against  v  in  Figs.  4  and  5.  For  this  model  circuit,  we  there¬ 
fore  experience  a  breakdown  of  the  validity  of  the  general  capacity  formula  as  soon  as  v  becomes 

less  than  a  v  «  0.62. 
c 

When  one  of  the  input  probabilities  goes  negative,  as  just  occurred,  Muroga  proves  that  the 
procedure  to  be  followed  consists  of  putting  one  of  the  input  probabilities  equal  to  zero  and  eval¬ 
uating  the  capacity  again.  If  solutions  can  be  obtained  with  one  of  several  of  the  input  probabili¬ 
ties  equal  to  zero,  the  capacity  is  the  largest  among  these  solutions.  If  no  solution  is  found,  one 
further  input  probability  must  be  equated  to  zero  and  the  procedure  repeated.  It  is  obvious  that 
such  a  procedure  cannot  be  used  in  our  general  case  where  we  might  begin  by  having  enormously 
large  quantum  numbers  at  the  transmitter. 

In  the  elimination  process  just  described  there  will  be  more  output  signals  than  input  signals, 
and  it  becomes  necessary  to  introduce  constraints  on  the  output  signals  in  order  to  be  able  to  get 

g 

definite  relations  between  output  and  input  probabilities.  Muroga  gives  a  general  procedure  to 
be  followed  in  order  to  introduce  the  constraints  among  the  output  probabilities.  Because  of  the 
simplicity  of  the  present  channel  matrix,  we  can  argue  that  when  the  ternary  channel  breaks 
down,  we  obviously  have  to  omit  the  input  (transmitted)  symbol  N  =  1  and  the  channel  matrix  re¬ 
duces  to 


A 


1  0  0 
(1  —  v 2r(l  —  r)  v ^ 


(82) 


This  channel  is  shown  in  Fig.  6.  By  simple  inspection  it  is  obvious  that  the  two  output  symbols, 
n  =  1  and  n  =  2,  can  be  combined  into  one  symbol  without  reducing  our  capabilities  of  inference 
about  the  transmitted  symbols.  The  properly  reduced  channel  matrix  thus  becomes 


A 


1  0 
( 1  —  v)^  1  —  ( 1  —  v)^ 


(83) 


21 


Fig.  4.  Capacity  (natural  units)  af  the  ternary  channel 
platted  against  v . 


V  v 

Fig.  5(a-b).  Output  and  input  probabilities  far  maximum  information 
transfer  in  a  ternary  channel  platted  against  v . 


[3-31-84471 

RECEIVED  n 
0 


Fig.  6.  Graphic  representation  af  the 
channel  [Eq .  (82) ]  . 
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A  straightforward  standard  analysis  shows  that 


C  =  log 


1  +  exp  |log  i/(2  -  v)  +  2  ~~~~y  log  ( 1  -  i/)J 


(84) 


The  output  probabilities  become 


Pr(0)  =  exp  [-C] 


pr(l)  =  exp  [—  C  +  log 2  +  2  _  -  log(l  -  v) 

,2 


pr(2)  =  exp 


c  +  2  I  log  v  +  -  g— ~  1°g<1  -  v) 


(85) 


and  for  the  two  non-zero  input  probabilities,  we  obtain 

*2 


Pt(2)  =  exp  C  +  2  log ( 1  -  i/)j 

pt(0)  =  1  -pt(2)  . 


(86) 


The  capacity  for  this  binary  scheme  is  plotted  against  v  in  Fig.  7,  and  the  different  probabilities 
in  Fig.  8.  Ih  the  binary  case,  we  do  get  valid  results  for  all  values  of  v,  but  for  v  >  =0.62 


V 


Fig.  7.  Capacity  (natural  units)  af  the  binary  channel  platted  against  v. 


V 


Fig.  8.  Output  and  input  probabilities  far  maximum  information 
transfer  in  a  binary  channel  platted  against  v. 
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Fig.  9. 


Capacity  (natural  units)  af  made!  channel  platted  against  v. 


10  06  0  6  04  02  0 


Fig.  10.  Signal  probabilities  required  ta  achieve  capacity  in  model 
channel  platted  against  v . 

the  ternary  encoding  scheme  gives  the  greater  capacity.  In  Fig.  9  we  show  the  maximum  channel 
capacity  for  all  values  of  v  for  the  particular  channel  under  consideration.  Figure  10  gives  the 
probabilities  as  a  function  of  v  for  the  optimum  encoding  scheme.  It  is  reassuring  to  observe 
that  all  the  curves  are  continuous  through  the  value  where  the  change  between  binary  and  ternary 
systems  takes  place. 

E.  Capacity  for  Extreme  Transmission  Losses  When  No  Additive  Noise  Is  Present 

We  now  return  to  the  more  general  case  where  a  very  large  number  of  possible  levels  can 
be  used  both  at  the  input  and  at  the  output  of  the  channel.  For  the  time  being,  we  assume  that 
there  is  a  peak  power  constraint  at  the  input,  i.e.,  N  <  M.  The  general  solution  of  the  capacity 
problem,  as  we  have  seen,  cannot  be  solved  by  any  of  the  methods  known  to  the  author.  It  is  pos¬ 
sible,  however,  to  obtain  the  capacity  in  the  extreme  case  when  v  -+  0. 

The  effect  of  increasing  the  transmission  loss,  i.e.,  decreasing  v,  is  to  decrease  the  num¬ 
ber  of  input  signals  that  can  be  used  in  order  to  achieve  channel  capacity.  This  process  of  de¬ 
crease  in  the  number  of  input  symbols  will  continue  until,  in  the  extreme  case,  only  two  input 
symbols  are  present,  namely,  N  =  0  and  N  =  M,  or  until  the  channel  has  become  binary.  We 
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therefore  conclude  that  for  extreme  transmission  losses,  the  best  we  can  do  is  to  use  an  on  —  off 
system.  When  this  stage  has  been  reached,  the  channel  matrix  assumes  the  form 


A  = 


(1  -  v) 


M 


o 


XM-1 
v)  v 


M-Z  Z 
v  . 


/M\  M 
\M/  V 


(87) 


In  exactly  the  same  way  as  in  the  ternary  channel  discussed  in  Sec.  IV-D,  we  can  conclude  here 
that  n  0  is  one  of  the  output  symbols  and  n  >  1  is  the  other  one.  The  channel  matrix  therefore 
reduces  to 


( 1  -  v) 


M 


1  -  (1  -  v) 


M 


(88) 


Putting 


Hm  =  — M(  1  -  M)M  log (1  v)  [1  (1  I/)M]  log  [1  -  (1  -  i>)M] 


C  =  log  1 1  +  exp  I  - 


1  -  (1  -  v) 


M . 


M  M 


■1! 


=  log  jl  +  [1  —  (1  —  v)  ]  exp 


M(  1  -  v) 


M 

TM 


log  ( 1 


•1  -  (1  -  vV 

For  numerical  computations  it  is  convenient  to  introduce  the  approximations: 
log  ( 1  -  id  «  -  v 


(1  -  v) 


M 


-ldV I 


(89) 


(90) 


This  means  that  we  replace  the  binomial  transition  probabilities  with  Poisson  probabilities,  which 
is  an  extremely  good  approximation  under  the  particular  conditions  considered  here.  The  capac¬ 
ity  now  simplifies  to 


C  «  log  |l  +  (1  —  e  l>^)  exp 


vM 


v  M  , 
e  —  1 


(91) 
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This  type  of  channel  has  previously  been  considered  by  R.  C.  Jones  who  evaluated  information 
transfer  for  particular  input  probabilities  but  did  not  optimize  the  information  transfer  with  re¬ 
spect  to  the  input  probabilities  as  we  have  done  here.  The  output  probabilities  implied  in  (91)  are 

pjo)  =  e'c  , 


Pi  (n  >0)  =  1  —  e 


and  the  input  probabilities  are 


pr(n  >  0) 


Pt(M)  p(n  >  0/M) 

pt(0)  1  -  pt(M) 


(92) 


1  ~Pr(0) 

1  —  exp  [—  rM  | 


(93) 


Z5 


It  would  be  of  some  interest  to  evaluate  the  probabilities  of  receiving  a  certain  number  of 
quanta,  n.  In  the  Poisson  approximation,  this  becomes 


Pr(n)  =  Pt(M) 


(fM)n  -I'M 
n! 


n  ^  1 


(94) 


Figure  11  shows  the  capacity  as  a  function  of  rM  for  the  binary  case  just  considered.  As 
rM  increases,  an  asymptotic  value  is  reached  corresponding  to  the  binary  channel  noiseless 
case.  Note  that  it  is  only  for  rM  less  than  some  number  near  unity  that  the  curve  shown  gives 
the  actual  peak  power  limited  capacity.  When  rM  is  slightly  larger  than  this  (undetermined!) 
number,  it  would  presumably  be  advantageous  to  use  a  ternary  channel.  For  even  larger  rM  the 
number  of  input  symbols  used  should  be  even  further  increased  to  optimize  the  capacity.  Unfor¬ 
tunately,  mathematical  difficulties  have  prevented  us  from  establishing  the  exact  values  where 
the  transitions  between  the  various  types  of  encoding  procedures  should  take  place. 


vM  «  P 


Fig.  11.  Binary  channel  capacity  for  peak  power  I  imitation . 


Fig.  12.  Input  probabilities  for  binary  channel . 
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Fig.  13.  Output  probabilities  for  binary  channel. 


Fig.  14.  Relation  between  vM  and  vM.  Fig.  15.  Curve  A  shows  the  chonnel  capocity 

agoinst  yMwhen  the  chonnel  is  binary.  Curve  B 
shows  the  no-partition  noise  capacity. 
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Finally,  in  order  to  provide  a  basis  for  the  comparison  of  the  quantized  channel  capacity 
with  partition  noise  with  the  quantized  channel  capacity  without  partition  noise  as  considered  by 
Gordon/  we  have  to  establish  the  binary  capacity  under  average  power  constraints.  Rather  than 
going  through  the  whole  optimization  procedure  again,  we  use  the  results  of  the  above  peak  power 
limited  case.  When  M  is  the  "on-level11  of  the  circuit,  we  have  already  established  the  capacity 
[see  Eq.  (91)].  Let  us  denote  this  capacity  by  C(M).  For  this  particular  value  of  M  we  have  for 
Pt(M) 


Pt(M)  = 


1  -  exp  [—  C(M)  ] 
1  —  exp  [—  I'M] 


(95) 


The  average  power  constraint  requires  that 


M  •  pt(M)  =  M 


(96) 


where  M  is  the  average  number  of  quanta  transmitted  in  an  elementary  signal.  Between  Eqs.  (95) 
and  (96),  we  can  solve  for  I'M  =  g(yM),  and  hence  we  can  find  the  binary  capacity  C  as  a  function 
of  (I'M).  The  relation  between  I'M  and  I'M  is  shown  in  Fig.  14.  In  Fig.  15  is  shown  the  capacity 
as  a  function  of  I'M,  that  is  the  mean  number  of  received  signal  quanta.  For  comparison,  we 
also  show  Gordon's  "noiseless”  capacity  curve  [see  Eq.  (55)].  It  is  immediately  obvious  that  the 
partition  noise  reduces  the  channel  capacity  very  considerably.  In  order  to  put  this  statement 
on  a  more  quantitative  basis,  we  inquire  about  the  amount  of  additive  noise  which  would  make  the 
no-partition  noise  capacity  curve  corresponding  to  (68)  have  the  same  slope  at  the  origin  as 
curve  A  in  Fig.  15.  The  equivalent  noise  occupation  number  turns  out  to  be 


N  =  0.745  .  (97) 

The  asymptotic  value  of  the  partition  noise  as  v  -+  0  therefore  is  equivalent  to  a  rather  substan¬ 
tial  additive  noise. 


V.  CONCLUSIONS 

In  this  report,  we  have  discussed  several  problems  arising  in  the  communication  by  means 
of  electromagnetic  waves  when  quantum  effects  come  into  play.  In  Sec.  II  the  problem  of  describ¬ 
ing  a  signal  was  discussed.  The  familiar  idea  of  sampling  as  used  in  the  classical  theories  was 
rejected  as  a  basis  for  a  signal  description.  Instead,  the  signal  was  divided  into  elementary  sig¬ 
nals  which  can  be  studied  by  means  of  a  cross-correlation  detection  procedure  which  makes  use 
of  the  total  elementary  signal  energy.  Certain  quantum  mechanical  considerations  showed  that 
the  elementary  signals  can  be  specified  to  an  integer  number  of  quanta  of  energy.  It  was  also 
shown  that  a  composite  signal  can  be  subdivided  into  approximately  BT  independent  elementary 
signals,  B  being  the  bandwidth  and  T  the  duration  of  the  total  signal. 

In  Sec.  Ill  a  relation  between  transmitted  and  received  signals  was  established  in  agreement 
with  quantum  theory,  and  it  was  shown  that  a  partition  noise  effect  shows  up  in  a  proper  deriva¬ 
tion  of  the  signal  channel  matrix. 

In  Sec.  IV  we  first  attempted  to  derive  the  channel  capacity  by  using  the  familiar  Lagrange 
multiplier  optimization  scheme.  This  procedure  was  shown  to  break  down  because  the  occur¬ 
rence  of  radiative  losses  requires  the  number  of  signal  possibilities  to  be  reduced  at  the  trans¬ 
mitter  in  order  to  achieve  maximum  capacity.  It  was  furthermore  established  that  under  extreme 
transmission  losses  the  optimum  encoding  procedure  tends  to  become  purely  binary.  For  this 
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extreme  case  the  channel  capacity  was  established  with  both  peak  and  average  power  constraints. 
It  was  shown  that  partition  noise  effects  become  rather  important  under  these  extreme  conditions 
and  an  equivalent  additive  noise  power  was  established. 

It  should  be  emphasized  that  the  results  described  in  this  report  are  not  complete  because  in 
several  places  general  conclusions  have  been  drawn  from  particular  examples.  All  the  problems 
in  connection  with  the  occurrence  of  negative  input  probabilities  should  be  put  on  a  firmer  basis, 
and  the  transition  regions  between  the  various  transmission  encoding  schemes  should  be  estab¬ 
lished  in  order  to  evaluate  the  true  capacity  under  more  general  conditions.  Another  point  which 
deserves  more  careful  attention  is  the  meaning  and  nature  of  the  measuring  procedure  involved 
in  establishing  the  commutation  relation  for  the  signal  parameters  B  and  B *  (see  Appendix  A). 
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APPENDIX  A 

QUANTIZATION  OF  AN  ELEMENTARY  PLANE  WAVE  SIGNAL 


Starting  with  an  elementary  plane  wave  as  in  (6),  we  obtain  for  the  Lagrangian  density  of  an 
elementary  signal 


€  /3A  \2  .  /oj\  w 

12  13 

the  variable  eanonieally  conjugate  to  A  is  found  to  be 


/ 9  A  \2 


(A  -1) 


9L 


9A 


x  a,  9A  x  o  3t 
d(-dT] 


and  the  Hamiltonian  density  becomes 


(A -2) 


h.  *  p?+*(!V 

2c  2 ui  '  9z  1 

o  o 


(A-3) 


In  order  to  evaluate  the  total  Hamiltonian,  it  is  convenient  to  express  an  elementary  signal 
in  the  following  way 


where 


a  pk  +(Ak/2) 

A  (z,  t)  -  —y-  \  °  {b  exp  [ik(z  —  et)]  +  B*  exp  [—  ik(z  —  et)]}  dk  , 

x  Jko-(Ak/2) 


k  =  — 
o  e 


Ak 


2tt 


(A-4) 


B  =  y  (a  +  i/3) 

In  this  representation,  the  eanonieally  conjugate  variable  becomes 
e  p  k  +(Ak/2) 

P  (z,  t)  =  —  ie  \  °  k  {  B  exp  [ik(z  —  ct)  ]  —  B  *  exp  [—  ik(z  —  ct)  ]}  dk  (A  -5) 

x  Ak  Jko-(Ak/2) 

When  computing  the  total  Hamiltonian,  we  have  to  limit  the  wave  in  the  plane  perpendicular  to 
the  z-axis  beeause  of  the  infinite  extent  of  a  plane  wave.  Let  us  imagine  a  rectangular  shape  of 
area  L^L^.  This  somewhat  artificial  limiting  of  the  wave  can  be  avoided  if  other  types  of  spatial 
modes  are  considered.  The  total  Hamiltonian  is  evaluated  as  follows 


H  =  L  L  C  Hdz 
x  y  J-oo 


2  o 


Ak 


kQ+(Ak/2) 

LxL  CJ  dkdk'kk'  dz  {  — BB*  exp  [i(k  —  k')  (z  —  ct)] 

k  -(Ak/2) 


-  B*B  exp  [— i(k  -  k')  (z  -  ct)]} 
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?  6 

H  =  c  — %  L  L  2tt 


kQ  +  (Ak/2) 


Ak 


x  y 


dkdk'kk'6(k  —  k')  (BB*  +  B*B) 


kQ-(Ak/2) 


O  € 

=  c 2  L  I 


Ak 


>k  +(Ak/2) 


p  K  H 

y  ^  ^  2tt(BB*  +  B*B)  \  °  dk 

x  y  JkQ-(Ak/2) 


=  L  L  cTe  u? 
x  y  o  o 


1  + 


±(_!L_\2 

3  \«  T/ 


(BB*  +  B*  B) 


(A-6) 


Next  we  have  to  establish  a  commutation  rule  for  the  B  and  B*  from  general  quantum  principles. 
We  base  the  quantization  procedure  on  the  following  commutator  relation  (see  for  instance  Ref.  13) 


[A  (z,t),P  (z\t)]  =  <5(z-z')  i* 

x  y 


([  ]  =  Poisson  bracket) 


(A-7) 


In  order  to  establish  the  desired  commutation  relation,  we  have  to  express  B  and  B*  in  terms 
of  A  and  P.  This  solution  for  B  and  B*  cannot  be  made  in  an  arbitrary  way.  It  turns  out  that 
the  commutator  of  B  and  B*  depends  on  the  way  in  which  they  are  related  to  the  field  repre¬ 
sented  by  A  and  P.  If  this  seems  strange,  we  must  realize  that  the  quantities  B  and  B*  only 
acquire  a  physical  meaning  provided  we  devise  an  experiment  to  measure  the  quantities.  When 
we  relate  B  and  B*  to  A  and  P,  we  must  regard  this  as  a  specification  of  an  experiment  for 
their  observation.  The  dependence  of  the  commutator  upon  the  representation  of  B  and  B*  there¬ 
fore  means  that  all  experiments  designed  to  evaluate  B  and  B*  are  not  equally  good. 

It  seems  that  the  best  one  can  do  is  to  evaluate  B  and  B*  by  the  following  formulas 


B 


.  p  k  +(Ak/2)  p 
=  T~\  dk  \ 


B 


ko-(Ak/2) 
k  +(Ak/2) 


.  pk  p 

*  =  ~  \  0  dk  \ 

47r  \  -(Ak/2) 


dz 


dz 


A  (z,  t)  + 
x 


A  (z,  t)  — 
x  '  ' 


ik 


€  ck  g 
o  o & 


2- 


ik 


c  ck  g 
o  o & 


2-  Px'2’1) 


exp  [— ik(z  —  ct)  ] 


exp  [ik(z  —  ct) ] 


(A  -8) 


r  1 

where  g  -  {l  +  j  (7t/Tcoq)  }.  It  is  obvious  that  these  equations  can  be  derived  directly  from  the 
representations  (A-4)  and  (A-5).  The  detailed  nature  of  the  corresponding  observations  will  not 
be  discussed  here,  but  it  appears  that  the  representations  for  B  and  B*  correspond  to  a  cross- 
correlation  experiment  where  the  cross -correlative  waveform  is  of  exactly  the  same  nature  as 
the  signal  waveform.  For  the  commutator  of  B  and  B*  we  obtain 


[B,  B*] 


*  M 


dkdk'dzdz '  |  — 


2  /{ k  [A  (z,t),P  (z'.t)] 

€  ck  g 

r\  ° 


+  k'  [A  (z1,  t),  P  (z,  t)]}  exp  [ik'(z'  -  ct)  -  ik(z  -  ct)  ] 


l6r^e  ck^gL  L 
o  o  x  y 


dzdkdk’dz ’  [kitf6(z  -  z')  +  k'itf6(z'  -  z)  ] 


x  exp  [ikr (z 1  —  ct)  —  ik(z  —  ct)] 
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[B,  B*]  - 


fi 


L  L  167 r^e  ck^f 

x  y  o  e 


^  ^  dkdk'(k  +  k')  dz  exp  [i (k 1  —  k)  (z  -  ct)  ] 


L  L  87re  ck  g 
X  y  o  b 


dkdk’(k  +  k')  6 ( k 1  —  k) 


L  L  cT  Ze  (jo  g 
x  y  o  o6 


(A  —9 ) 


To  simplify  even  further,  we  put  V  =  L  L  cT  which  equals  the  volume  occupied  by  the  signal, 

x  y 

and  obtain 


/  cj  g 

a  =  /  V  *  — B 


y2f  a)  g 

V  -  — Tjr-  B*  • 


(A-10) 


Substituting  these  new  operators  into  (A -6),  we  obtain  the  familiar  expression  for  the  elementary 
signal  Hamiltonian 

H  (aa*  +  a*  a)  =  4iu)^(a*a  +  y)  (A- 11) 

14 

It  is  shown  in  many  standard  texts  on  quantum  mechanics  that  a  convenient  set  of  basis  vectors 
for  the  description  of  such  a  system  is  formed  by  the  eigenvectors  of  the  operator  a* a,  viz., 

|  0 >,  |  1>,  .  .  .  |n>.  It  is  also  shown  that  the  corresponding  eigenvalues  are  the  integers  0,  1.  .  .  n, 
respectively.  The  creation  operator,  a*,  and  the  annihilation  operator,  a,  are  completely 
specified  by  the  relations 

a*  |  n>  -  \l n  +  1  |  n  +  1  >  , 

a  |  n  >  -  \rt  i  |  n  —  1  >  , 

a  |0>  -  0 

With  this  we  have  reached  a  valid  quantum  description  of  the  elementary  signals  and  hence  of  a 
composite  signal.  We  observe  that  only  a  discrete  set  of  states  of  the  elementary  signals  is 
possible;  in  fact,  an  elementary  signal  at  center  frequency  a;Q  can  be  specified  to  an  accuracy 
in  energy  of  Similar  conclusions  can  be  reached  for  other  types  of  elementary  signals. 
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APPENDIX  B 

DIRECT  EVALUATION  OF  P 

r 

In  order  to  evaluate  P^,  we  compare  the  two  signal  representations  at  the  transmitting 
aperture  plane.  To  avoid  unnecessary  complications,  we  compare  terms  with  identical  time- 
variation. 

From  Eqs.  (9)  and  (11)  we  have  in  the  t-representation 


A(x.y.k)  -  (-L)2  [  \  dkxdkyFt(kxky)  Bt  exp[i(kxx  +  kyy)] 


+  complex  conjugate 


(B-l) 


The  function  F^k^k^)  is  defined  in  (23).  In  the  r  —  /  representation,  the  terms  with  similar 
time-variation  are  [see  Eq.  (24)] 


H'2  il  dkx' 


dk  F  (k  k  )  B  •  exp 
y  r  x  y  r  r 


i[kxx  +kyy-zoJkZ-  (kf  +  k2)  ]  J 


+  similar  /  -term  .  (B-2) 

Integration  of  both  (B-l)  and  (B-2)  over  the  transmitting  aperture  gives  the  equation 

\  \  dk  dk  F.(k  k  )2  B.  =  \  \  dk  dk  F  (k  k  )  F.(k  k  )  expf-iz  J  k2  -  (k  2  +  k2)  B 
J  J  x  y  V  x  y'  t  J  J  x  y  r'  x  y'  V  x  y'  1  [  o  V  x  y  J  r 


+  similar  I -term 

The  left  side  can  be  integrated  over  k  and  k  directly,  giving 

x  y 

at( 2tt)2  Bt  . 


(B-3) 


(B-4) 


In  order  to  make  the  right-hand  side  easy  to  evaluate,  we  make  the  assumption  that  the  exponen¬ 
tial  factor  varies  much  more  rapidly  than  the  F^F^  factor.  Only  contributions  near  k^  k^  -  0 
will  be  important,  and  we  obtain  the  approximate  equation 


Bt  *  “r  ^  dkxdky  '  exp[-izo  J* 


2  (k2  t  k2)  |  B 

x  y  1  ’ 


+  similar  I -term 


(B-5) 


Because  k^  and  k^  can  deviate  but  little  from  zero,  the  square  root  can  be  expanded  to  give 


Bt  *  <T7)2  “r  '  exPt-ikzol  dkxdky  '  exP  izo(~Sk  7 ■) 


B 


+  similar  /-term  a 

2t r  r 


i  exp  [— ikzQ]  •  —  Br 
o 


+  similar  /-term 

Finally,  we  introduce  the  operators  a  and  a*  through  the  Eq.  (A- 10) 


Bt  = 


* 


cTa,2e  a;  g  t 
too6 


(B-6) 
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and  obtain 


r  1  . 

r  /  cTa  2e  oo  g  ar 
n/  r  o  o& 


i 

a,  =  Jot  .a  •  exp  f— ikz  ]  —  •  a  +  t  -term 
t  27r\/tr  oJz  r 

o 

We  therefore  conclude  that 


P  |2  =  (>L)2^r 

r1  ■  {Zn'  2 


which  is  just  the  transmission  loss  in  a  propagation  circuit  (see  for  instance  Ref.  20). 


(B-7) 


(B-8) 
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